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Abstract 

We introduce a mixed generalized Dynkin game/stochastic control with f-^-expectation in 
a Markovian framework. We study both the case when the terminal reward function is Bore- 
lian only and when it is continuous. By using the characterization of the value function of a 
generalized Dynkin game via an associated doubly reflected BSDEs (DRBSDE) first provided 
in [16], we obtain that the value function of our problem coincides with the value function 
of an optimization problem for DRBSDEs. Using this property, we establish a weak dynamic 
programming principle by extending some results recently provided in [17]. We then show a 
strong dynamic programming principle in the continuous case, which cannot be derived from 
the weak one. In particular, we have to prove that the value function of the problem is contin¬ 
uous with respect to time t, which requires some technical tools of stochastic analysis and new 
results on DRBSDEs. We hnally study the links between our mixed problem and generalized 
Hamilton-Jacobi-Bellman variational inequalities in both cases. 

Key-words: generalized Dynkin games, Markovian stochastic control, mixed stochastic con- 
trol/Dynkin game with nonlinear expectation, doubly reflected BSDEs, dynamic programming 
principles, generalized Hamilton-Jacobi-Bellman variational inequalities, viscosity solution. 
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1 Introduction 

In this paper, we study a new mixed stochastic control/optimal stopping game in a Markovian 
framework which can be formulated as follows. We consider two actors A and B. Actor A, called 
“controller/stopper”, can control a state process A" through the selection of a control process a, 
which impacts both the drift and the volatility, and can also choose the duration of the “game” 
via a stopping time r. Actor B, called “stopper”, can only decide when to stop the game via an 
another stopping time a. 

‘Institut fiir Mathematik, Humboldt-Universitat zu Berlin, Unter den Linden 6, 10099 Berlin, Germany, email: 
roxana@ceremade.dauphine.fr. The research leading to these results has received funding from the Region Ile- 
de-France. 

RPMAjUniversite Paris 7 Denis Diderot, Boite courrier 7012, 75251 Paris cedex 05, France, email: 

quenez@math.univ-paris-diderot.fr 

HNRIA Paris, 3 rue Simone Iff, CS 42112, 75589 Paris Cedex 12, France, and Universite Paris-Est, email: 

agnes.sulem@inria.fr 


1 



We denote by T the set of stopping times with values in [0, T], where T > 0 is a fixed terminal 
time, and by A the set of admissible control processes a. For each a € A, let {X^) be a jump 
diffusion process of the form 

Xf = x+ f b{X^,au)du + 

Jo 

If A chooses a strategy (a, r) and B chooses a stopping time a, the associated cost (or gain), is 
defined by g{X^) if A and B decide to stop at terminal time T, hi{X!J) if A stops before B, and 
/i 2 (X“) otherwise. More precisely, the cost is given by 


/ 


a{X^,au)dWu + 



l3{X^,au, e)N{du, de). 


I^{t, a) := + /i 2 (X“)l,<, + g{X^)lr=.=T- (1-1) 

If the strategy of A is given by {a,T), the stopper B wants to choose a in order to minimize 
the expected cost evaluated under a nonlinear expectation. This nonlinear expectation denoted 
by £°‘ is defined via a BSDE with jumps with a driver /“ which may depend on the control a. 
The minimal expected cost for B is then given by info-er r ao-'^)]- The aim of actor A is to 
maximize this quantity over all choices of (a, r), which leads to the following mixed optimization 
problem: 

sup inf (1.2) 

(a,T)g^xT°^^' 

In the special case when the first player can only act on the duration r of the game (i.e. when 
there is no control a), this problem reduces to a generalized Dynkin game that we have introduced 
in [16]. It is proved there that the value function at 0, given by 


sup inf To^rAcr [^1 (-^r ) l-r<cr,r<T T ^2 (-^cr) l-cr<r T ^(^t) lr=cr=T] ? (^•^) 

reT'^eT 

is characterized via a doubly reflected BSDE. 

Note that Problem (1.2) can be seen as a mixed generalized Dynkin game/Stochastic control 
problem since 


sup 

(a,T)€AxT 


inf ^^rA<7[^“(T 
/ 


sup ( sup inf u)]). 

a^A tST 


(1.4) 


The control a can then be interpreted as an ambiguity parameter on the model which affects both 
the drift and the volatility of the underlying state process. 

Using the characterization of the solution of a DRBSDE as the value function of a generalized 
Dynkin game we have provided in [16], Problem (1.4) corresponds to an optimization problem on 
DRBSDEs with jumps. 


In this paper, we first give a weak dynamic programming principle for Problem (1.2) when g is 
assumed to be Borelian only, which follows from some fine results recently obtained in [17] together 
with some properties of DRBSDEs. We then focus on the continuous case for which we prove 
a strong dynamic programming principle. We stress that it cannot be directly derived from the 
weak dynamic programming principle. To this aim, we show in particular the continuity of the 
value function with respect to time t, which requires some refined properties of doubly reflected 
BSDEs with jumps. The last part of the paper is devoted to the relation of the value function with 
generalized Hamilton-Jacobi-Bellman variational inequalities (HJBVIs). We prove that the value 
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function is a weak viscosity solution of the HJBVIs in the irregular case, and a classical viscosity 
solution in the continnons case. Uniqueness is obtained under additional assnmptions. In terms of 
PDEs, our result provides a probabilistic interpretation of nonlinear HJBVIs. 

This work completes the one of Buckdahn-Li [10]), who have studied a related optimization 
problem for doubly reflected BSDEs (of the form sup^, inf^) in the case of a Brownian hltration and 
a continuous reward g. Unlike our approach, they do not nse a dynamic programming principle to 
show that the value function is a viscosity solution of a generalized HJB equation. 

The paper is organized as follows. In Section 2, we introdnce the mixed generalized Dynkin 
game/stochastic control problem. In Section 3, we provide some preliminary properties for doubly 
reflected BSDEs with jumps. In Section 4, we prove the dynamic programming principles both in 
the irregular and the regular case. In Section 5, we derive that the value function of our problem 
is a weak (respectively classical) viscosity solution of some generalized HJBVIs in the irregular 
(respectively regular) case. In Section 6, nsing the resnlts obtained in the previons sections, we 
provide some additional resnlts for the value function in the discontinnons case. In the Appendix, 
we provide some complementary properties which are nsed in the paper. 

Related literature on games and mixed control problems with stopping times. Classical 
Markovian Dynkin games (with linear expectation) have been studied in particular by Bensoussan- 
Friedman [4], and by Bismnt [6], Alario-Nazaret et al.[l], Kobylanski et al [28] in a non Markovian 
framework. See also e.g. Hamadene-Lepeltier [21] for the stndy of mixed classical Dynkin games. 
Links between Dynkin games and doubly reflected BSDEs have been provided in the classical case 
(see e.g. Cvitanic-Karatzas [13], Hamadene and Hassani [20], Hamadene and Wang [23], Hamadene 
and Ouknine [22]), and extended to generalized Dynkin games (that is with nonlinear expectation) 
in [16]. 

Controller/stopper games are special cases of Problem (1.2) when player A is only a controller 
(no stopping time r). They have been stndied in the case of linear expectation by e.g. Karatzas- 
Zamfirescu [25], Bayraktar-Huang [2] and Choukroun et al. [11]. 

Finally, mixed optimal stopping/stochastic control problems are special cases of Problem (1.2) 
when there is no player B. There is then no more game aspect. In the particular case of linear 
expectation, we refer to Bensonssan-Lions [3], 0ksendal-Snlem [29], and Bouchard-Tonzi [7] who 
provided a weak dynamic programming principle when the valne function is irregular. This result 
has been extended to the T-^-expectation case in [17], where links between the value function and 
generalized HJBIs are provided under very weak assumptions on the terminal cost (or reward) 
function. 

We have seen above that Problem (1.4) is related to an optimization problem on DRBSDEs. 
When the terminal reward map g is continnons, some optimization problems relative to BSDEs (see 
e.g. Peng [30]), to RBSDEs (see Buckdahn-Li [9]) or DRBSDEs (Buckdahn-Li [10] in the Brownian 
case) have been stndied in the literature. 

Motivating applications in mathematical finance. Links between classical Dynkin games 
and game options have been provided by e.g. Kifer [26], Kifer and Yu [27], Hamadene [19]. Recall 
that a game contingent claim is a contract between a seller and a bnyer which allows the seller 
to cancel it at a stopping time time a £ T and the buyer to exercise it at any time t £ T. The 
process X may be interpreted as the price process of the nnderlying asset. If the buyer (resp. the 
seller) exercises (resp. cancels) at maturity time T, then the seller pays the amount giXx) to the 
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buyer. If the buyer exercises at time t <T before the seller cancels, then the seller pays the buyer 
the amount /ii(Xt-), but if the seller cancels before the buyer exercises, then he pays the amount 
h 2 {X(j) to the buyer at the cancellation time a. The difference h 2 {Xfj) — hi{Xfj) > 0 is interpreted 
as a penalty that the seller pays to the buyer for the cancellation of the contract. In other terms, if 
the seller (resp. the buyer) selects a cancellation time a (resp. an exercise time r), the seller pays 
to the buyer at time r A cr the payoff 

.I(t, (j) . T h2(^cr)I-(T<r T i7(-^T) 1 t=(T=t] • 

In a perfect market model, there exists an (unique) fair price for a game option, which can be 
characterized as the value function of a Dynkin game of the form 

sup inf I{T,a) = sup inf E[hi{Xr)lT<a,T<T + h 2 {X^)l^^r + 5(-’^t)1t=<7=t], 

where the expectation E is taken under the risk-neutral measure, and X may be interpreted as 
the price process of the underlying asset (see [26], [27] and [19]). Recently, we have generalized in 
[18] this result to the case of nonlinear pricing by using the results on generalized Dynkin games 
provided in [16]. The fair price of the game option is then of the form (1.3). 

In the presence of constraints and ambiguity on the model represented by an “ambiguity” 
parameter a ^ A, the controlled payoff is given by (1.1), and the nonlinear expectation by 

T". In this case, there is a set of possible fair prices for the game option which can be written as 


sup inf £:^^^^[/"(r, 0 -)] = sup mfS^ /,^[hi{Xf)lr<a,T<T + h 2 {X^)la<r + g{X!f)lr=a=T], 


where a £ A. The supremum of these possible prices over A then corresponds to the value function 
of the mixed generalized Dynkin game/stochastic control problem (1.4). 


2 Mixed stochastic control/generalized Dynkin game 

Let T > 0 be fixed. We consider the product space D ;= Dw <8) ^n, where Qw •= C([0,T]) is the 
Wiener space, that is the set of continuous functions from [0, T] into such that u;^(0) = 0, 
and Dat := D([0, T]) is the Skorohod space of right-continuous with left limits (ROLL) functions 
from [0,T] into R'^, such that w^(0) = 0. Recall that D is a Polish space for the topology of 
Skorohod. Here p,d> 1, but, for notational simplicity, we shall consider only R-valued functions, 
that is the case p = d = 1. 

Let B = (R^jR^) be the canonical process defined for each t £ [0,T] and each co = by 

Bl(oj) = Bl{uj^) := uf for i = 1, 2. Let us denote the first coordinate process R^ by W. Let be 
the probability measure on (Dw,R(Dw)) such that IT is a Brownian motion. Here B{Qw) denotes 
the Borelian cj-algebra on Dw- 

Set E := R"'\{0} equipped with its Borelian cr-algebra R(E), where n > 1, and a cr-finite positive 
measure u such that f^(l A |e|)i^(de) < oo. We define the jump random measure N as follows: for 
each t > 0 and each B £ R(E), 

N(u;,[0,t] X B) = X B) := l{Au;^eB}- (2T) 

0<s<t 
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Let be the probability measure on such that is a Poisson random measure 

with compensator v{de)dt and such that s-s- Note that the sum of jumps is 

well defined up to a P^-null set. We denote by N{dt,de) := N{dt,de) — u{de)dt the compensated 
Poisson measure. The space Q is equipped with its Borelian cr-algebra B{Q) and the probability 
measure P := P^ 0 P^. Let IF := {Pt)t<T be the filtration generated by W and N completed with 
respect to B{Q) and P (see e.g. [24] p.3 for the definition of a completed filtration). Note that Pt is 
equal to the completion of the cr-algebra B{^1) with respect to P, and Pq is the cr-algebra generated 
by P-null sets. Let P be the predictable cr-algebra on hi x [0, T] associated with the filtration F. 

We introduce the following spaces : 

• Hj. (also denoted := the set of real-valued predictable processes {Zt) with E Z^ds < oo. 

• := the set of real-valued RCLL adapted processes (ips) with E[supo< 5 < 7 ’</ 5 ^] < oo. 

• := the set of measurable functions I : (E, /C) —>■ (M, R(M)) such that 

||/||^ := J-^P{e)v{de) < oo. The set is a Hilbert space equipped with the scalar product 
{I, l')iy := l(e)l'(e)iy(de) for all I, I' € x L^. 

• : the set of predictable real-valued processes (/?*(•)) with E ||A:s||^ 2 £^s < oo. 

Let A be a nonempty closed subset of Let A be the set of controls, defined as the set of 
predictable processes a valued in A. For each a € A, initial time t £ [0, T] and initial condition x 
in M, let {Xs’*’'^)t<s<T be the unique M-valued solution in of the stochastic differential equation: 


Xf= X + 


i: 


b{X^’^’^,ar)dr+ I a{X^’^’^ ,ar)dWr + 


L 


a,t,x 


r f 

Jt Je 


,ar,e)N{dr,de), (2.2) 


where b, u : M x A —>■ M, are Lipschitz continuous with respect to x and a, and (3 :RxAxE^M 
is a bounded measurable function such that for some constant C > 0, and for all e G M 


|/3(x,a,e)| < C'l'(e), x€]R,aGA 

|/3(x, a, e) — /3(x^, a', e)| <C{\x — x'\ -|- |q: — a^|)'I'(e), x, x' £ M, a, a' £ A, 
where T G fl L),. 

The criterion of our mixed control problem, depending on a, is dehned via a BSDE with a 
Lipschitz driver function / satisfying the following conditions: 

(i) / : A X [0, T] X m 3 X ^ (M, H(M)) is B{A) <g H([0, Tj) 0 ^(M^) ® H(L2)-measurable 

(ii) \f{a, t, X, 0,0,0)1 < C{1 + |x|^), Va G A, t G [0, T], x G M, where p £ N*. 

(hi) [/(a, t, X, y, z, k) - f{a',t, x', y', z', k')\ < C{\a - a '\-F jx - x'| -Fly - y'j -F [2 - z'\ + ||A; - ), 

Vt G [0, T], X, x', y, y', z, z' £ M, k, k' £ L^, a, a' £ A. 

(iv) f{a,t,x,y,z,k2) - f{a,t,x,y,z,ki) >< 7(0:, t, x, y, z, fci, ^2), ^2 - ki >u,Vt,x,y, z,ki,k2,a, 

where 7 : A x [0, T] x M^ x (L^)^ —Ll is B{A) (g) H([0, Tj) (g H(M3) (g H((L^)^)/H(L^)-measurable, 
|7(-)(®)l ^ 7 ( 0 ( 6 ) > —1 di'{e)-a.s. where T G L^. 

Condition (iv) allows us to apply the comparison theorem for non reflected BSDEs, reflected 
and doubly reflected BSDEs with jumps (see [31], [32] and [16]). 
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Let (t, x) € [0, T] xM and a ^ A. Let (also denoted by be the nonlinear conditional 

expectation associated with defined for each stopping time S and for each ( € Lp‘{Fs) as: 

t<T<S, 

where {yr’^’^)t<r<s is the solution in 5^ of the BSDE associated with driver z,k) ■= 

f{ar,r,Xr'^’^,y,z,k)^ terminal time S and terminal condition (, that is satisfying the dynamics 


- ^ k^’^’^)dr - z^’^’^dWr - [ k^’^'^{e)Nidr, de), (2.3) 

Je 

with yg’^’^ = C) where z°‘’^'^,k°‘’^'^ the associated processes in and respectively. 

For all {t, x) G [0, T] x M and all control a € we define the barriers for i = 1, 2 by /ij(s, 
for t < s < T, and the terminal condition by where 

(i) (7 : M —>■ M is Borelian, 


(ii) hi : [0, r] X M ^ M and /i 2 : [0, T] x M ^ M are functions which are Lipschitz continuous with 
respect to x uniformly in t, and continuous with respect to t on [0,T], with hi < /i 2 , 

(iii) hi (or / 12 ) is with bounded derivatives, 


(iv) \hi{t,x)\ + \h 2 {t,x)\ + | 5 (a;)| < (7(1 + Vt G [0,T],a; G M, with p G N. 

Let T be the set of stopping times with values in [0,T]. Suppose the initial time is equal to 
0. For each initial condition a: G M, we consider the following mixed generalized Dynkin game and 
stochastic control problem: 


n(0, x) 


sup sup ini 
a&Ar&T^^I 


hl(T,X“’0’")W,,<T + h2(fT,X, 


a, 0 ,a: 


)W+3(^r°’'')lrA^=T 


(2.4) 

We now make the problem dynamic. We define, for t G [0, T] and each w G 12 the t-translated 
pathw* = {uj\)s>t := {uJs—0Jt)s>t- Note that {ujl'^)s>t ■= corresponds to the realizations 

of the translated Brownian motion := {Wg — Wt)s>t and that the translated Poisson random 
measure := iV(]t, s], .)s>t can be expressed in terms of '■= (w^ — similarly to 

(2.1). Let F* = {Fs)t<s<T be the filtration generated by W* and N* completed with respect to B{Q) 
and P. Note that for each s G [t,T], is the cr-algebra generated by W^, N^, t < r < s and 
pQ. Recall also that we have a martingale representation theorem for FLmartingales as stochastic 
integrals with respect to and N^. 

Let us denote by Tt the set of stopping times with respect to F* with values in [t, T]. Let P^ 
be the predictable cr-algebra on 12 x [t,T] equipped with the filtration F*. 

We introduce the following spaces of processes. Let t G [0, T]. 

Let be the R*-measurable processes Z on 12 x [t,T] such that ||Z||]gi 2 := F[J^^ Z^du] < oo- 
Let Bl| j, be the set of RLmeasurable processes iL on 12 x [t, T] with ||iL||][j 2 := F[J)^ ||iL„||^dM] < 00. 

Let Sf be the set of M-valued ROLL processes p on 12 x [t, T], FLadapted, with E[supj<s< 7 ’ tpl] < 00. 

Let A^i be the set of controls a : 12 x [t, T] 1 —>■ A, which are R*-measurable. For each initial time 
t and each initial condition x, the value function is defined by: 


u{t, x) 


• r nOi.t.x 

sup sup mf ^ 

a&Aj rST? 


hiir,X!^^^nir<a,r<T + h2(a, + 5(X“’*’")1 


TAa=T 


(2.5) 
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Note that since a, r and a depend only on w*, the SDE (2.2) and the BSDE (2.3) can be solved 
with respect to the translated Brownian motion (Wg — Wt)s>t and the translated Poisson random 
measure N'Ot, s], ■)s>t- Hence the function u is well defined as a deterministic function of t and x. 
For each a € we introduce the function u°‘ dehned as 


u 


{t,x) := sup mi [hi(r, +/i 2 (cr, + c?(X^’*’"')1^a<7=t 


tGT/ 


We thus get 


u{t,x) = sup u°‘{t,x). 


aeAl 


( 2 . 6 ) 


Note that for all a,x and t < T, we have hi{t,x) < u°‘{t,x) < h 2 {t,x), and hence hi{t,x) < 
u{t,x) < h 2 {t,x). Moreover, u°‘{T,x) = u{T,x) = g{x). 

By Assumption {in), hi (or / 12 ) is with bounded derivatives. It follows from Proposi¬ 
tion A.l in the Appendix, that for each t € [0,T] and for each a G A\, the processes := 

hi{s,Xf'^’'^)ls<T + g{XT^''')'^s=T and := h 2 {s,Xf^’'^)ls^T + g{X^’^’'^)ls=T satisfy Moko- 

bodzki’s condition: there exist two nonnegative F*-supermartingales and H in such 

that 

^a,t,x < _ ^'a,t,x < ^a,t,x ^ t<S<T a.S. (2.7) 

By Theorem 4.7 in [16], for each a, the value function of the above generalized Dynkin game is 
characterized as the solution of the doubly reflected BSDE associated with driver barriers 

^a,t,x _ = h 2 {s, Xs'^’^) for s <T, and terminal condition g{X!^’^'^), that is 

u“(t,x) = y“’*’", (2.8) 

where G <S^ x x is the solution of the doubly reflected BSDE: 


y 'Gt^t^X 

.q — 


+A 




5(X“’‘’") + r/(a,,r, A:“’^’"(-))dr 

T 

r 


T r'^ r 

l,a,t,x _ ^ _ j {r, e)N{dr, de) 

Js 7 e 


a,t,X ^ ^a^t,X 


< Cs ,t <S<T a.S. , 


^ha,t,x^ j^2,a,t,x 

are RCLL nondecreasing predictable processes with = A. 


2,a,t,x 

t 


= 0 and 


rT 

/ (C’*’^ - = 0 a.S. and 1 a.s. 

<Jt ^ ^ 

(2.9) 

Here (resp. ) denotes the continuous part of (resp. A^) and 

(resp.A^’"’*’*’'^ ) the discontinuous part. In the particular case when hi{T,x) < g{x) < h 2 {T,x), 
then the obstacles Qa,t,x ga,tisfy for all predictable stopping time r, and > ({r 

a.s. which implies the continuity of A^’"’*’^ and A^’"’*’^ (see [16]). 

Note that the doubly reflected BSDE (2.9) can be solved in <S^ x x with respect to the 
t-translated Brownian motion and the t-translated Poisson random measure. 

In the following, for each a G M*, y,"’*’^ will be also denoted by yj^^’^[g(A^’*’^)]. 
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Using (2.8), our initial optimization problem (2.5) can thus be reduced to an optimal control 
problem for doubly reflected BSDEs: 

u{t,x) = sup = sup 

a&A\ oi&Al 

We now provide some new results on doubly reflected BSDEs, which will be used to prove the 
dynamic programming principles. 

3 Preliminary properties for doubly reflected BSDEs 

We show in a general non Markovian framework a continuity property and a Fatou lemma for 
doubly reflected BSDEs, where the limit involves both terminal condition and terminal time. 

A function / is said to be a Lipschitz driver if 

/ : [0, T] X D X X —>■ M (w, t, y, z, k{-)) !->■ /(w, t, y, z, k{-)) \sV® measurable, 

uniformly Lipschitzian with respect to y, z, k{-) and such that /(., 0,0, 0) € H^. 

A Lipschitz driver / is said to satisfy Assumption 3.1 if the following holds: 

Assumption 3.1. dP ® dt-a.s for each {y,z,ki,k 2 ) € x (L^)^, 

f(t,y,z,ki) - f{t,y,z,k 2 ) > ^ 

with 7 : [0, r] X D X X (L^)^ —)■ ; (w, t, y, z, fci, ^ 2 ) supposed to he V 

. 8 (M^) ® B{{L‘^)‘^)-measurable, and satisfying dP ® dt-a.s., for each {y,z,ki,k 2 ) G x (L^)^, 
> —1 and ||7l^’^’^^’^^(-)ll!^ < where K is a positive constant. 

This assumption ensures the comparison theorem for BSDEs, reflected BSDEs and doubly 
reflected BSDEs with jumps (see [31], [32] and [16]). It is satisfied if, for example, / is of class 
with respect to k such that V^f is bounded (in L^) and V/~f > —1 (see [16] Proposition A. 2 ). 

We extend the definition of reflected and doubly reflected BSDEs when the terminal time 
is a stopping time 6 € P and the terminal condition is a random variable f in L^^Pq). let / 
be a given Lipschitz driver. Let {rjt) be a given obstacle ROLL process in S^. The solution, 
denoted {Yp{Sf),Z^^ 0 {f),k^^ 0 {ff)), of the reflected BSDEs associated with terminal time 9, driver /, 
obstacle {gs)s<9, and terminal condition f, is defined as the unique solution in 5^ X X of the 
reflected BSDE with terminal time T, driver /(t, y, z, A;)l|i<gj, terminal condition ^ and obstacle 
VtUKe + f,h>e- Note that b,6»(0 = = Q,ktfi{C) = 0 for t > 6 ». Similarly, let (yf) and 

(Ct) be given ROLL processes in 5^ satisfying Mokobodzki ’s condition, that is, there exist two 
nonnegative supermartingales H and H in 5^ such that 

Vs ^ Hs — Hg — 0 < s < T a.s. 

The solution, denoted (y]_e(^), Z ^(^), A:,^e(^)) of the doubly reflected BSDEs associated with ter¬ 
minal stopping time 6, driver /, barriers {Vs)s<e and (Cs)s<6») and terminal condition is defined 
as the unique solution in 5^ x x of the doubly reflected BSDE with driver f{t, y, z, A:)lp< 5 i}., 
terminal time T, terminal condition ^ and barriers VtP<e + ^^>9 and Ctit<e + fP> 9 - Note that 
Yt,g{f) = e, Zt,0{O = 0, kt, 9{0 = 0 for t > R 

We first prove a continuity property for doubly reflected BSDEs where the limit involves both 
terminal condition and terminal time. 


Proposition 3.2. [A continuity property for doubly reflected BSDEs] Let T > 0. Let f be a 
Lipschitz driver satisfying Assumption 3.1. Let {ry), (Q) be two ROLL processes in 5^, with rj < C.- 
Let f be a given Lipschitz driver. Let (0”)nGN be a non increasing sequence of stopping times in 
T, converging a.s. to 9 gT as n tends to oo. Let (^"')nGN be a sequence of random variables such 
that £^[ess sup„(.^”')^] < +oo, and for each n, is Ten -measurable. Suppose that for each n, the 
processes and satisfy Mokobodzki ’s condition. Suppose 

that ffl converges a.s. to an Eq- measurable random variable as n tends to oo. Suppose that 

Ve<i<Ce a.s., (3.1) 

and that the processes ptl{i<0} + Ct^{t<e} satisfy Mokobodzki ’s condition. 

Let be the solutions of the doubly reflected BSDEs associated with driver f, 

barriers {r]s)s<e^ and {C,s)s<e^ (f^sp. {r]s)s<d and {Cs)s<d) , terminal time 0” (resp. 9), terminal 
condition ffl (resp. Then, for each stopping time r with t < 9 a.s., 

Yr.eiO = lim y a.s. 

n—^+oo 

When for each n, 9n = 9 a.s., the result still holds without Assumption (3.1). 

Remark 3.3. ^s in the case of refleeted BSDEs (see Proposition A.6 in the Appendix), there is 
an extra difficulty due to the presence of the barriers (and the variation of the terminal time). The 
additional assumption (3.1) on the obstacle is here required to obtain the result. 

Proof. We first consider the simpler case when for each n, 9n = 9 a.s. By the a priori estimates 
on donbly reflected BSDEs provided in [16] (see Proposition 5.3) and the convergence of to 
we derive that yr, 6)(0 = limn^+oo ^T,e (0 a.s. 

We now turn to the general case. In this case, Proposition 5.3 in [16] does not give the result. By 
the flow property for doubly reflected BSDEs (or “semigroup property”, see [10]), we have: 

= yr,9iYe,eAC))- 

By the first step, it is thus sufficient to show that lim„_j. ^ a.s. 

Since the solution of the doubly reflected BSDE associated with terminal condition ffl and terminal 
time 9n is smaller than the solution of the reflected BSDE associated with (one) obstacle 

terminal condition and terminal time 9n , we have: 

Yyefle) (3.2) 

By the continuity property of reflected BSDEs with respect to terminal time and terminal condition 
(Lemma A. 6 ), we have lim^^oo Eg 0 ^(^") = Y 0 ^g{f,) = ^ a.s. Taking the limsup in (3.2), we get 

lim sup Ygp^{C) < lim YgpflC) = ? a.s. 

n^oo n—>-oo 

It remains to show that 

lim inf Ee (^”) > ^ a.s. (3.3) 

n—^oo 

Since the solution of the doubly reflected BSDE associated with terminal condition and terminal 
time 9n is greater than the solution Yfiflffl) of the reflected BSDE associated with the upper 
obstacle (Ct)t< 6 »„) terminal condition ffl and terminal time 9^, we have: 

Ye^gfle)>Yg,eSe)^-^- (3.4) 
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By Lemma A.6, we derive that the solution of a reflected BSDEs with upper barrier is continuous 
with respect to both terminal time and terminal condition. Hence, lim„_^oo ^ a.s. 

Taking the liminf in (3.4), we derive inequality (3.3). □ 

Using Proposition 3.2 together with the monotonicity property of the solution of a doubly 
reflected BSDEs with respect to terminal condition, one can derive the following Eaton lemma. 

Lemma 3.4 (A Eaton lemma for doubly reflected BSDEs). Let T > 0. Let {r]t), (Ct) be two 
ROLL processes in , with ry. < and satisfying Mokobodzki’s condition. Let f be a Lipschitz 
driver satisfying Assumption 3.1. Let {0'^)n&i be a non increasing sequence of stopping times in T, 
converging a.s. to 9 € T as n tends to oo. Let (C”)neN be a sequence of random variables such that 
E[sup„(^")^] < Too, and for each n, is IFg-n-measurable. Suppose that for each n, the processes 

satisfy Mokobodzki’s condition. 

Let i := liminf„^+oo?"'- Suppose that the processes r]tl{t<d} + o.nd Ctl{t< 0 } + 

satisfy Mokobodzki’s condition, and that 

Ve < i <Ce a.s. (3.5) 

Let E_ 0 n(^"') (resp. Yp{f^)) be the solution(s) of the doubly reflected BSDEs associated with 
driver f, barriers {t]s)s< 9 ^ and {C,s)s< 9 ^ (resp. {'r]s)s <9 and {Cs)s< 9 )> terminal time 0”' (resp. 9), 
terminal condition (resp. f,). 

Then, for each stopping time r with t <9 a.s., 

dVe(0 ^ liar inf yT-en(^"') a.s. (3-6) 

’ — n —>-+00 ’ 

When for each n, 9n = 9 a.s., the result still holds without Assumption (3.5). 

Remark 3.5. The result also holds for := lim supjj^_,_oo instead off, with the converse inequal¬ 
ity in (3.6), 

ICI liminf replaced by limsup, 

under the same assumptions with f replaced by f 

The proof, which is very similar to that of the Eaton lemma for classical BSDEs (see Lemma A.5), 
is left to the reader. This lemma will be used to obtain a weak dynamic principle. 

4 Dynamic programming principles 

We provide now two dynamic programming principles. When the terminal reward g is Borelian 
(resp. continuous), we show a weak (resp. strong) dynamic programming principle. We stress that 
in the continuous case, the strong dynamic principle cannot be directly deduced from the weak one. 

4.1 A weak dynamic programming principle in the irregnlar case 

Measurability properties of the functions u and We first show some measurability 

properties of the functions u°‘{t,x) with respect to control a and initial condition x. 

By the a priori estimates on doubly reflected BSDEs provided in [16] (see Proposition 5.3), we 
derive the following measurability property of 
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Lemma 4.1 (A measurability property of u“). Let s G [0,T]. 

The map {a,x) e-)- u^{s,x); x M, B'{Al) < 8 ) —> (M, is measurable. Here B'{Al) 

denotes the a-algebra induced by B{M'^) on Af. 

Proof. Let x^,x‘^ € M, and By classical results on diffusion processes, we have 

E[sup < Ci\\a^ - + |a:i - x^)- (4.1) 

r>s ^ 


Let ’*[C,C.jbe the solution at time s of the doubly reflected BSDE associated with driver 
j^a,s,x ._ (^f(^a^^'p,Xr’^’^,.)lr>s), barriers Cr < Cr, s < r < T, terminal condition p. We here 
suppose that ■= CAr<T + h^r=T and (r '■= CAr<T + f?lr=T Satisfy Mokobodzki’s condition (see 
(2.7) or Definition 3.9 in [16]). 

Using the Lipschitz property of / with respect to x, a, we have 


\\snp\f{a\X°‘'’^’^\y,z) - f{a‘^ 


,s,x^ 


,y,z)\ ||h 2 < C'dja^ - a^||H 2 + 11-’^ 


,S,X^ 




II 52 ). 


y,^ 


By the estimates on doubly reflected BSDEs with universal constants provided in [16] (see Propo¬ 
sition 5.3), we obtain that for all x^,x^ G M, a^,a^ G A®, barriers G S^, ^ and 

terminal conditions 77 ^, G L^, 


< C'dia^ - a2||2^ + 1^1 _ ^2|2 ^ 11^1 _ ^2||2^^ ^ 11^1 





(4.2) 


where C is a constant depending only on T and the Lipschitz constant C of the map /. 

Let 4> : A® X M X (5^)^ x {a,xA.X.:V) ^7 (4-2), the map $ is 

Lipschitz-continuous with respect to the norm || . ||h2 + | • [^ + II • II52 + || ■ II52 +11 • Ilia- 
Now, by Proposition 3.6 in [17], the map n —>■ L^, ^ 1 -^ g{^) is Borelian. Hence, the map 
(a, x) 1 -^ (Q:,x,/ii(., A"’^’*),/i 2 (., A"’^’*), 5 ((X^’®’'^)) defined on x M and valued in x M x 
(5|)2 X Ll is B{Al) 8 B{R)/ B{Al) 8 B{R) 8 B{{S^f) 8 B(L2).measurable. 

By composition, it follows that the map (a,x) i-)- Y^.^’^[g{X^’^’^)] = u^{s,x) is Borelian. □ 


Using the a priori estimates on doubly reflected BSDEs provided in [16] and standard arguments, 
we derive the following result. 

Lemma 4.2. For each t G [0,T], for each a G A^, the map u" has at most polynomial growth with 
respect to x. The property still holds for the value function u. 


We introduce the upper semicontinuous envelope u* and the lower semicontinuous envelope u* 
of the value function u, defined by 


u*{t,x) := limsup u{t',x')] ri*(t,x):= liminf u{t',x') V(f,x) G [0,T] x M. 

{f ,x')^{t,x) 

We also define the maps u* and u* for each {t,x) G [0,T] x M by 


u*{t, x) := u*(t, x)lt<T + fl'(a^)lt=T; «*(U x) := u^{t, x)lt<T + g{x)lt=T- 


11 


The functions u* and ft* are Borelian. We have < u < u* and tt*(T,.) = u{T ,.) = u*{T ,.) = g{.). 
Note that u* (resp. tt*) is not necessarily npper (resp. lower) semicontinnons on [0,T] x M, since 
the terminal reward g is only Borelian. 

For each 0 € T and each ^ in L'^{Tg), we denote by the uniqne 

solution in 5^ X x of the doubly reflected BSDE with driver terminal time T, 

terminal condition ^ and barriers hj(r, )lr<0 + r>e- 

We now state a weak dynamic programming principle for our mixed game problem, which can 
be seen as the analogous of the one shown in [17] for a mixed optimal stopping/stochastic control 
problem. 


Theorem 4.3. [Weak dynamic programming principle] Suppose the subset A of M is compact. We 
have the following sub-optimality principle of dynamic programming." 
for each t G [0, T] and for each stopping time 9 € Tf, 


u{t, x) < snp 
a&Al 




a,t,x\ 


(4.3) 


We also have the super-optimality principle of dynamic programming." 
for each t G [0, T] and for each stopping time 9 € Tf, 


u{t, x) > snp 
a&A\ 


u^{9,X 


a^t,x 

e 


) 


(4.4) 


These results still hold with 9 replaced by 0" in inequalities (4.3) and (4.4), given a family of 
stopping times indexed by controls G Aj}. 

Proof. Without loss of generality, we can suppose that t = 0. Let us show inequality (4.3). For 
each {t,x) G [0,T] x M, we set hi{t,x) := hi{t,x)lt<:T + g{x)lt=T- Let 9 G T. For each n G N, we 
define 9^^ := + T1o=t, where 4 := and Ak := < 9 < Note that 

9^ gT and I 9. 

Let n G N. Let a G .4. By the flow property for doubly reflected BSDEs, we get = 

^TT. ^ 9 ^ ^ 

T' ]" each s G [0,r], for each a; G O, set ^uj := {u}rAs)o<r<T- Note that for each 

a; G n, we have a; = In the sequel, we identify uj with 

By the splitting property for doubly reflected BSDEs (see Proposition A. 2), there exists a P-null 
set M such that for each k and for each u; G n we have 




a, O’ 


I ya.,Q,x 




(^) = 


. -v^a.O.a; 

ikT 


= Y^ 


adkuj,-),ti,,Xf’°’'^dkuj) 
tkT 






where the last equality follows from the definition of Now, by definition of u, we 

+ r\ r\ ^ 0 ^ 

have X]] ’^(*'=u;)) < u{tk,X[[f ’^(^''w)). Since u < u*, we derive that Y^f j,' < 


u*(9^,Xg^ ’^) a.s. Hence, using the comparison theorem for doubly reflected BSDEs, we obtain 
0 ,0 a _ Ef ] < V°^[^’^lu*(9"', X^^^’^)]. By taking the limsup, we thus get 


y; 


0,T 


0 , 0 " 


t/-Q:,0,X ^ -I* t/-Q;, 0,1E r —5(: //ITT, tt-CK.O.Xnt 

Y^y < lim sup Y^y [u {0 , )J. 


(4.5) 


12 








Let us now show that the assumptions of the Fatou lemma for doubly reflected BSDEs (see 
Lemma 3.4 and Remark 3.5) are satished. Recall that u<h 2 on [0,T[xM. Since ^2 is continuous, 
we get u* < h 2 on [0,T[x]R. On {9 < T}, we thus have 

lim sup u* {9'^, < lim sup h2{9^, = h2{9, Xg’^'^) a.s. 

n^oo n—)-oo 


where the last equality follows from the continuity property of /12 on [0, r[xM. 
Now, on {9 = T}, 9^ = T. Hence, we have 




We thus get limsup„_^_,_oo < h2{9, Xg’^’^) a.s. 

Similarly, one can show that {9"'^ Xg^'^) > hi{9, Xg’^'^) a.s. We thus have 

hi{9,Xg'^'^) < limsupu*(0"',X^^°’^) < /i 2 ( 0 ,a.s. 

n—>-+cx) 


Condition (3.5) (with limsup instead of liminf) is thus satisfied with = u*(0",and 

gt = hi{t, X"’”’"^) and 0 = h 2 (L ^ 

Since hi is C^’^, by Proposition A.l, for each n, /ii(s, X“’*’^)l|j,< 5 in}. + and 

/i 2 (s,Xf’*’^)l{ 5 <- 0 n}. +C"'l{s=e"} satisfy Mokobodzki’s condition. 

Moreover, setting ^ := limsup„_j,_,_oou*(0”,X^^*^’^), the processes /ii(s,X“’*’^)l{^<e} + 
and /i 2 (s,Xf’*’^)l|g< 5 i}. + also satisfy Mokobodzki ’s condition. We can thus apply the 

Fatou lemma for doubly reflected BSDEs (Lemma 3.4). Using (4.5), we then get: 


-'0,T 


< lim sup Y^fn'^[u*(9'^,Xg^°’'^)] < yQ"^°’'^[lim sup u*{9'' ,y^g 


X^’")]. 


Using the upper semicontinuity property of u* on [0,T[xM and u*{T,x) = g{x), we obtain 


■^a,0,x 

-'O.T 


ra,0,x 
— -‘ 0,0 


< [lim sup u*(9 


n -v^a,0,fc\ 

, A, 


)]<rf’"[u*(0,xr’")], 


0n 


0,0,Xx 


and this holds for each a ^ A. Taking the supremum over a G M, we get (4.3). 

The proof of inequality (4.4) relies on similar arguments as above as well as on an existence result of 
e-optimal controls satisfying appropriate measurability properties (see Proposition A. 3). For more 
details on this last argument, we refer to the proof of the weak dynamic DPP for a mixed optimal 
stopping/control problem with expect at ions (Th. 3.9 in [17]). □ 


4.2 A strong dynamic programming principle in the continnous case 

In this section, the set A, where the controls are valued, is a nonempty closed subset of M^. 

We suppose here that Assumption 4.4 holds. 

Assumption 4.4. The terminal reward map g is Lipschitz-continuous and the barriers maps hi, 
/i 2 are Lipschitz-continuous with respect to x uniformly in t. Moreover, we have 

hi{T,x) < g{x) < /i 2 (T,x), Vx G M. (4.6) 
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Under this assumption, we show that u is continuous with respect to x, and that is continuous 
with respect to {a,x). 

Lemma 4.5 (A continuity property of and u). Suppose Assumption 4-4 holds. Then for each 
s G [0,T], the map {a,x) i-)- u°‘{s,x) is continuous and the value function x i-)- u{s,x) is continuous. 

Proof. By estimates (4.1) and (4.2), we derive that 


u 




-u""(s,x^)p = \Yf 

< C'{\\a^ - a^||g2 + W - 


b(^r 


where C is a constant depending only on T and the Lipschitz constant C of the map /. It follows 
that the map (a, x) u“(s,x) is Lipschitz-continuous. 

Hence, since u{s,x) = sup^, u“(s, x), the map x i->- u(s,x) is Lipschitz-continuous. □ 


In order to show the strong dynamic programming principle, we first prove that the value 
function u is continuous with respect to {t,x). We have already shown that u is continuous with 
respect to x uniformly in t (see Lemma 4.5). It is thus sufficient to show the continuity of u 
with respect to t. To this purpose, we first prove a strong dynamic programming principle at 
deterministic times. 


Lemma 4.6. Suppose that g, hi and /i 2 satisfy the continuity Assumption 4-4- Let t G [0,T]. 

For all s >t, the value function u defined by (2.5) satisfies the equality 

u{t,x) = sup [u{s,XfL^)] . (4.7) 

Proof. We first show that: 

u{t,x) < sup [u{s,X^Lx^j _ 

«G.4‘ 


By the flow property for doubly reflected BSDEs (see [10]), we have that: 




^ t,s 


'[Ys 


s.T 


Note that for almost-every tv, at fixed ^co, the process a{^uj,T^) (denoted also by 0 (^ 0 ;, •) belongs 
to .4.| (see Section A.2 in the Appendix). Moreover, by Proposition A.2, for almost-every oj, at 

fixed ^cj, coincides with ^ solution at time s of the doubly reflected 

BSDE associated with control •), initial conditions s, with respect to the filtration 

and driven by the s-translated Brownian motion and s-translated Poisson measure. Now, by 
using the definition of tt“, we get that for almost-every uj, 




Finally, the comparison theorem for doubly reflected BSDEs (see Th. 5.1 in [16]) leads to: 


n?" = YtT'^iYsT^ < Y,f’^[uis,XfL^)]. 
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Taking the supremum over a € in this inequality, we get inequality (4.8). 

It remains to show the inequality: 

sup [u{s,Xf’^’^)] < u{t,x). (4.9) 

a€At 


Fix s G [t,T] and a G Aj. By Corollary A.4, there exists an “optimizing” sequence of measurable 
controls for u(s, A“’*’^), satisfying appropriate measurable properties. More precisely, there exists 
a sequence (a"')„gN of controls belonging to such that, for P-almost every ca, we have 


u(s,A“’^’^(*w)) = lim 


“’*’"(*ta)) = lim 


(4.10) 


For each n G N, we set; 


•— 0:ulu<s T 0 !u^s<u<T- 


Note that d" G A\. By the splitting property for doubly reflected BSDEs (see Proposition A. 2), 
for P-almost every ca, we have 


- ^s,T 1^1 


Hence, by (4.10), applying the continuity property of doubly reflected BSDEs with respect to 
terminal condition (see Proposition 3.2), we obtain 




X 


a,t,x\ 


= Y, 


a^t,x 


t,s 


lim Y' 


a'^,t,x 

s,T 


= lim Y. 


a,t,x 


t,s 


Now, by the flow property of doubly reflected BSDEs, for each n, we have 
We thus get 


^ s,T 


Y 


a,t,x 

t,s 


^s,T 


(4.11) 


Y, 


a^t^x 


rtt(s, Af’*’*)] = lim YAp’^’^ < u{t,x). 


Taking the supremum on a G A* in this inequality, we get (4.9), which ends the proof. 


□ 


Using this strong dynamic programming principle at deterministic times, we derive the conti¬ 
nuity property of u with respect to time t. 

Theorem 4.7. Suppose that g, hi and /12 satisfy the continuity Assumption 4-4- Yhe value function 
u is then continuous with respect to t, uniformly in x. 

Proof. Since (4.6) holds, we have hi{t,x) < u{t,x) < h 2 {t,x) for all {t,x) G [0,T] x M. 

Let 0 < t < s < T. We have 


u{t,x) - n(s,a:)| < \u{t,x) - sup [u(s,Af’*’*)]| -h | sup S^f’''[u{s,X^’*’^)] - u{s,x)\ 


^£A\ 


G4‘ 






We start by estimating | supQ,g_ 4 t £’“/’^[tt(s, A 

-n(s,a;)| < sup |T“/’"[u(s, Af’*’")] - u(s,x)| 
qG. 4 * 

(A"’*’^ - xf]^ < C|s - t|(l + x2P)i (4.12) 

S 


I sup 

“G.A* 

< sup \£^f^^'^[u{s,xf’^’^)] -£:j°Ju(s,x)]| < CEf sup 

a&A\ t<r<s 
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Here, denotes the conditional expectation associated with the driver equal to 0. In order to 
obtain the above relation, we have used BSDEs estimates (see [31]), the Lipschitz property of u 
with respect to x (see Lemma 4.1) and the polynomial growth of u (see Lemma 4.2). 

We now estimate \u{t,x) — £^g’^[u{s, Xs’^’^)]\. Using the strong dynamic programming 

principle for deterministic times (see Lemma 4.6), we derive that: 

u{t,x) = sup . 


Now, the solution Y of the doubly reflected BSDE is smaller than the solution of the reflected 
BSDE with the same lower barrier, denoted by Y. Hence, 

[u{s, Xf’^n] < YtT + u{s, Xf’*’")W=,] . 

By the characterization of the solution of a reflected BSDE, we derive that 

i{t,x)-sup£tf[u{s,Xf^^-)] < sup sup(^“;‘A:[/^i(uX“’*’")W<, + u(s,Xf’*’")W>,] 


^&Al 


< A, 


(4.13) 


where 


A:= sup sup lhl(T, X^’*’^)lr<s + U(S, X^’*’^)lr>s] - lhl(s, X^’^’^)lr<s + U(S, X^’^’^}lr>s]l 


aeAf t£T^ 


with /“(s, •) := /“’*’^(s, because u > hi. By the Lipschitz property in x of hi, the polynomial 

growth of hi and / in x, and the standard estimates for BSDEs and SDEs, we have 

< C sup sup E[(hi(r,X“’‘’*)l^<s - hi(s,Xf’‘’*)l^<s)^]+E[ / /^(ar, r, hi(T, X"’*’""), 0,0)dr] 

aeAlreT^ Jr As 

< C'[ sup sup |hi(r, x) — hi(s, x)p + (1 + |x|^)^|s — t| + |s — t|(l + |x|'^)]. (4-14) 

t<r<s X 

Also, the solution Qf doubly reflected BSDE is greater than the solution of the reflected 

BSDE with the same upper barrier, denoted by y“>h3:_ fjence, 

y“/’" [u(s, Xf 4,-)] > y“/-- X^’^’-)U<r<s + u{s, X^’^’-)lr=s] • 

By the characterization of the solution of a reflected BSDE with upper barrier , we derive that 

Y,f’^ [h2{r,XA^’-)lt<r<s+n{s,Xf’^’-)lr=s] = inf f“4;f,[h2(a,X“4--)i^<, +ii(s, 


GT/ 


It follows that 


u 


it,x)- sup£^f^[uis,Xf’^n] > sup inf (^“ 4 ^,[h 2 (u,X“ 4 --)i^<,+ !!(,, 


^€At 


^eAl 


-^“/’"[u(s, Xf+ u(s, Xf’‘’")w,]) 


> B, 


(4.15) 
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where 


B : = 




,g^t <Ter; 


with /“(s, •) := because u < h 2 . Now, by the Lipschitz property in x of / 12 , the 

polynomial growth of /i 2 and / in x, we have 


< C sup inf E[(h 2 (u,X“’‘’^)l^<,-h 2 (s,Xf’‘’^)l 


ig^t (xer/ 


,<,)2]+E[r , 

J O'As 


f\ar,r, h 2 ia, 0)dr] 


< C'[ sup sup |/i 2 (r,x) —/i 2 (s,x)p + (1 + |x|^)^|s — t| + |s — t|(l + Ixl*^)]. (4-16) 

t<r<s X 


By (4.13) and (4.15), we get 

\u{t,x) - sup ^[u(s,Xf4-*)]p < + B'^. 

a&A\ 

This, together with inequalities (4.14), (4.16) and inequality (4.12) implies that 

!«(!, x) — tt(s, x)p < C'[ sup sup(|/ii(r, x) —/ii(s, x)p + sup sup(|h 2 (r, x) —/i 2 (s, x)p] 

t<r<s X t<r<s X 

+ C[(s - tf + |s - t|((l + IxH' + 1 + \xn- 

Now, since hi and /i 2 are continuous with respect to t on [0, T] uniformly in x, we derive that u is 
continuous with respect to time t, uniformly in x. The proof is thus ended. □ 

From this theorem, we derive the continuity property of u with respect to (t,x). 

Corollary 4.8. Suppose that g, hi and h 2 satisfy the continuity Assumption 4-4- 
The value function u is then continuous on [0, T] x R. 

Proof. Since Assumption 4.4 is satisfied, u is continuous with respect to x (see Lemma 4.1). This 
property together with the previous theorem implies that u is continuous with respect to (t, x). □ 


This result yields that under Assumption 4.4, we have u* = = u. Hence, by the weak 

dynamic programming principle (which still holds even if the set A is not compact because of 
the continuity Assumption 4.4) it follows that the value function u satisfies the following strong 
dynamic programming principle at stopping times. 

Theorem 4.9. [Strong dynamic programming principle] Suppose that g, hi and h 2 satisfy the 
continuity Assumption 4-4- For each t € [0,T] and for each stopping time 9 € T], we have 


u{t, x) 


sup Y[ 
a&A\ 


a,t,x 

t,e 


u{e,x 


a,t,x\ 
6 ) 


(4.17) 


This result still holds with 9 replaced by 9°‘, given a family of stopping times indexed by controls 
{9^,aeAi}. 
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5 Generalized HJB variational inequalities 

In this section, we do not suppose that Assumption 4.4 holds. 

We introduce the following Hamilton Jacobi Bellman variational inequalities (HJBVIs): 

hi{t,x) < u{t,x) < h 2 {t,x), t <T 

< if u{t,x) < h 2 {t,x) then infcg^ x) > 0 (5.1) 

if hi(t,x) < u{t,x) then inf^g^x) < 0, 

with terminal condition u{T,x) = g{x),x € M. Here, L" := A" + and for (j) € C'^(M), 

• A“(/>(x) := ^a'^{x,a)^{x)+ b{x,a)^{x) 

• K^(j){x) := /j, (^(f){x + /3(x, a, e)) - (f){x) - |^(x)/3(x, a, e)^ i^{de) 

• B°‘(p{x) := (p{x + /3(x, a, •)) — ^(x). 

f)'7l fjOj 

• H“0(x) := - —(t,x) - L°‘u{t,x) - f(a,t,x,u(t,x), {a—){t,x),B°‘u{t,x)). 

Definition 5.1. • An upper semicontinuous function u is said to be a viscosity subsolution of (5.1) 
if for any point {tQ,xo) G [0, r[xM and for any (j) G C'^’^([0,T] x M) such that 4){tQ,XQ) = u{to,Xo) 
and (p — u attains its minimum at (to,xo), if hi(to,xo) < u{tQ,xo) then infog^ x) < 0. 

• A lower semicontinuous function u is said to be a viscosity supersolution of (5.1) if for any 
point (to,xo) G [0,T[xM and for any cp G (^^’^([O, T] X M) such that p(to,xo) = u(to,xo) and p — u 
attains its maximum at (to,xo), if u(to,xo) < h 2 (to,xo) then infaeA dt‘^p(t, x) > 0. 

5.1 The irregular case 

Using the weak dynamic programming principle (Theorem 4.3), we now prove that the value func¬ 
tion of our problem is a viscosity solution of the above HJBVI in a weak sense. 

Theorem 5.2. Suppose that A is compact. The map u is a weak viscosity solution of (5.1) in the 
sense that u* is a viscosity subsolution of (5.1) and u* is a viscosity supersolution of (5.1). 

Remark 5.3. Using this theorem, when hi(T,x) < g{x) < /i 2 (T,x) for all x £ W, we show in 
the next section that u* is a viscosity supersolution of (5.1) with terminal value greater than g^ 
(see Corollary 6.2). Moreover, when g is l.s.c, we show that the value function u is the minimal 
viscosity supersolution of (5.1) with terminal value greater than g (see Theorem 6.5). 

Proof. We first prove that tt* is a viscosity supersolution of (5.1). 

Let {to,xo) G [0,T[xM and p G C^'‘^{[t),T] x M) be such that (^(to;3:o) = u*ito,xo) and 

p{t,x) < tt*(t, x), for each (t,x) G [0,T] x M. Without loss of generality, we can suppose that the 

maximum is strict in (to,Xo). 

Let us assume that u^:{tQ,xo) < h 2 {to,xo) and that 

mf xo) - L'^p{to,xo) - f (^, to,xo, p{to,xo), {a^){to,xo),B'^p{to, xq)^ ^ < 0 . 
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By continuity, we can suppose that there exists a € A, e > 0 and % > 0 such that: 

V(t, x) such that to <t <to + % < T and |x — xo| < we have (p{t, x) < x) — s and 


x) - x) - f t, X, 4i{t, x), x), x) ) < -e. (5.2) 


Let B^^(to,xo) be the ball of radius and center {to,xo). Let {tn,Xn)neN be a sequence in 
Br]eito,xo) with tn > to for each n, such that the sequence (tn, Xn, u{tn,Xn))n& tends to (to,xo, 3:o))- 
We introduce the state process associated with the above constant control a. 

Let 0“’"' be the stopping time dehned by 


6/"’” := {to + r]^) A inf{s > t^ , - xo| > ryj. 


By applying Ito’s lemma to 4>{s, one can derive that 


d(f). 




(5.3) 


is the solution of the BSDE associated with terminal time 0"’”, terminal value 4>{6°‘’'^, Xga,n'^’^) and 

d 

driver —-(/^“(s, where '0“(s,x) := —(j){s,x) + L"(/)(s, x). By the definition of the stopping 

os 

time 0“’” together with inequality (5.2), we obtain: 


< /(a, </.(s,Xf(a|^)(s,X“’*'- 


),B“0(s,Xf’‘-""))-e, (5.4) 


for tn < s < 0"’”' ds (8> dP-a.s. Now, since the maximum {to,xo) is strict, there exists 7^, which 
depends on rj^, such that 


u^{t,x) > 4>{t,x) +75; on [0,r] X M \ B^^(to,xo). 


(5.5) 


Note now that 


(/.(r-”At, + -/>(r’”, tn<t<T. 

Using the inequalities (i){t,x) < h 2 {t,x) —e and (5.5) together with the definition of 0"’”, we derive 
that for each t £ [tn, 0"’”'], 


(/-(t,X“’'"’"") < (/i2(t,X“’'-"'*) - 4)U<0-." + (u*(r’”,X“’:-"'*) - 6 e)lt= 9 c,r. a.s. , 

where 6 ^ '■= min(e,7e). Hence, by the inequality (5.4) between the driver process —'0“(s, 

and the driver f{a, s, computed along the solution (5.3), applying a comparison theorem 

between a BSDE and a reflected BSDE (see Proposition A.11 in [17]), we obtain 




a,n ^a,tn,Xr} 


4'{tn,Xn) < Y, 


t„,e° 


'[h2{t,X^ 


)lt<6»“’'» A u^{0°‘’^ , Xga,n’ ")lt=e“."] — deK, 


(5.6) 


where Y is the solution of the reflected BSDE associated with upper barrier 

/i2(t, A'“’*"’*")lt<0c«,n + u*{ 6 ^’'^,Xga,n’^")lt=ea,n aud K IS & positive constant which only depends 
on T and the Lipschitz constant of /. 
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Moreover, since the solution of the reflected BSDE with upper barrier is smaller than the solution 
of a doubly reflected BSDE with the same upper barrier, we have 


\ -| , //lQ:,n '^OL,tn •,Xn\-\ 1 ^ T/-a,tn D, f nCt,71 


Hence, using inequality (5.6), we get 


0(tn,Xn) < - 6eK. 


a,n 


(5.7) 


Now, (p is continuous with p{to,xo) = u^:(to,xo), and the sequence (tn,Xn,u{tn,Xn)) converges 
to {tQ,XQ,u^{to,XQ)) as n tends to +oo. We can thus assume that n is sufficiently large so that 


\(p(tn^ Xn) 


u{tn-,Xn)\ 



By inequality (5.7), we thus get 


u{tr, 




OLftn ,Xn 
Qa,n 


U: 


..{e^ 


? -^Qa,n 


)]- 


SeK 

2 


Now, by the super-optimality dynamic programming principle (4.4), since u* > u^,, we have 
u{tn,Xn) > ’*")]. We thus obtain a contradiction. Hence, tt* is a viscos¬ 

ity supersolution of (5.1). It remains to prove that u* is a viscosity subsolution of (5.1). The proof, 
which is based on quite similar arguments, is omitted. □ 


Remark 5.4. In the case of a Brownian framework and a continuous terminal reward, this property 
corresponds to a result shown in [10] by using a penalization approach. 

5.2 The continuous case 

Theorem 5.5. Suppose that the set A is a closed subset ofMP and that the continuity Assumption 
4-4 holds. Then the value function u is a viscosity solution in the classical sense, that is both a 
viscosity sub- and super-solution of the generalized HJBVIs (5.1). 

Proof. Since Assumption 4.4 holds, by Corollary 4.8, the value function u is continuous with 
respect to {t,x), which implies that u* = = u. Moreover, by Theorems 4.3 and 5.2 (which do 

not require the compactness of A in the continuous case), u is a weak viscosity solution of the 
generalized HJBVIs (5.1). The result thus follows. □ 


An uniqueness result. Suppose Assumption 4.4 holds and A is compact. We assume moreover 
that E = M*, 1C = B(M*) and /^.(l A e^)v{de) < -|-oo, and the following 

Assumption 5.6. 1. f{a,x,y,z,k):=f{a,x,y,z,J.^k{e)'y{x,e)i'{de)) where 

f--A X [0,T] X ^ R is Borelian and satisfies: 

(i) \f{a, t, X, 0, 0,0)1 < C, for any x € R, t € [0, T], a € A. 

(a) \f{a,t,x,y,z,k) -J{a',t,x',y',z',k')\ < C{\a - a'\ + \y - y'\ + |^ - z'\ + \k - k'\), for 
any x, x' G R, t G [0, T], y, y' G R, z, z' G R, k, k' G R, a, a' G A. 

(in) k —)■ /(a, t, x, y, z, k) is non-decreasing, for any (a, t, x, y,z,k) G A x [0, T] x R^. 


20 




2. For each R > 0, there exists a continuous function niR : M_|_ —>■ M_|_ such that mij(O) = 0 and 

\f{a,t,x,v,p,q) - J[a,t,y,v,p,q)\ < mR{\x - y|(l + b|)), 
for any t G [0,T], |x|, \y\, |?;| < R,p,q G M,a G A. 

3. 7 ; M X E ^ M is B(M) ( 8 ) B(E) -measurable, 

\'){x, e) — 7 (x', e)| < C\x — x'\{l A e^),x, x' G M, e G E 
\l{x, e)\ < (7(1 A |e|) and j{x, e) > 0, e G E 

4- There exists r > 0 such that for any x G M, t G [0, T], it, x G M, p G M, / G M, a G 

/(a, t, X, v,p, 1) — f{a, t, X, u,p, 1) > r{u — v) when u > v. 

5. | 5 (x)| + \h{t,x)\ < C, for any x G M, t G [0,T]. 

Lemma 5.7 (Comparison principle). Suppose that the above assumptions hold. If U is a bounded 
viscosity subsolution and V is a bounded viscosity supersolution of the HJBVIs (5.1) with U{T,x) < 
g{x) < V{T,x), X G M. Then, U{t,x) < V{t,x), for each {t,x) G [0,T] x R. 

The proof is similar to the case studied in [16] without controls and is thus omitted. 

Moreover, using this comparison principle together with Theorem 5.5, we derive the character¬ 
ization of u as the unique solution of the HJBVIs. 

Theorem 5.8 (Characterization of the value function). Suppose that the above assumptions hold. 
The value function u is then the unique viscosity solution of the HJBVIs (5.1) in the class of 
bounded continuous functions, in the sense that u is a viscosity sub and super solution of (5.1) with 
terminal condition u{T,x) = g{x). 

Using the comparison principle (Lemma 5.7), we show below some new results in the irregular case. 

6 Complementary results in the discontinuous case 

Using the previous results both in the discontinuous case and the continuous case, we show some 
additional properties of the value function when the terminal reward map is discontinuous. 

Let 5 be a Borelian function such that there exists K > 0 and p G N with |fi'(x)| < K{1 -J |x|^) 
and satisfying hi{T,x) < g{x) < /i 2 (r, x) for all x G R. Let gf* be the l.s.c. envelope of g. 

We denote by u^{t,x) the value function of our problem associated with terminal reward g. 

Proposition 6.1. Let {gn)neN be a non decreasing sequence of Lipschitz continuous maps satisfying 
for each n, hi(T,x) < gn{x) < /i 2 (T,x) and \gnix)\ < K{1 + |x|^), x G R, and such that g^ = 
limn^oo t 5 n (which exists by analysis results). Then, we have 

(t, x) = limn^oo t {t, x). 

Moreover, for each n, u®" is continuous and u^* is l.s.c. 
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Proof. By definition of the value function, 


= sup 


ra,t,X\ 


aeAl 


By the continuity property of doubly reflected BSDEs with respect to the terminal condition (see 
Proposition 3.2) and by the comparison theorem, we have: 


From the two above equalities, we derive that 


u^*{t,x) = sup sxvi>Y^j.’^[gn{X^’ ’^)] = sup sup Y^j^’^[gn{X^' ’'^)] = lim u^'^{t,x). 


a&A\ »^GN 


By the continuity property of the value function in the continuous case (see Corollary 4.8), is 
continuous because gn is continuous. Hence, u^* is l.s.c. as supremum of continuous functions. □ 


Corollary 6.2. We have: 

1. If g is l.s.c., then is l.s.c. 

2. ui > u^*. 

3. The l.s.c. envelop ul of the value function is a viscosity supersolution of the HJBVIs (5.1), 
with terminal value greater than gf*. 

Proof. 1. Since g is l.s.c., g* = g. Hence, Point 1. directly follows from the above Proposition. 

2. As g > g*, we have > u^*. By Point 1, u^* is l.s.c., and we thus obtain u* > u^*. 

3. By Point 2., ui{T,x) > u^*{T,x) = g*(x). Using Theorem 5.2, we derive that uf is a viscosity 

supersolution of the HJBVIs (5.1), with terminal value greater than g^. □ 

Corollary 6.3. Suppose that A is compact. If g is l.s.c., then is l.s.c., and is a viscosity 
supersolution of the HJBVIs (5.1) with terminal condition u^{T,x) = g{x), x G M. 

Proof. By Point 1. of Corollary 6.2, as g is l.s.c., is l.s.c. Hence, by Theorem 5.2, is a 
viscosity supersolution of (5.1). □ 

Proposition 6.4. Suppose that A is compact and Assumption 5.6 holds. Let v be a viscosity 
supersolution of the HJBVIs (5.1) with v{T,x) > g*{x), x G M. We then have v > u^*. In other 
words, u^* is the minimal viscosity supersolution of the HJBVIs (5.1) with terminal value greater 
than g*. 

Proof. Let (gn)nGN be a non decreasing sequence of continuous maps satisfying the assumptions 
of Proposition 6.1. For each n, we have v{T,x) > gn{x), x G M. As is a continous viscosity 
solution, then it is a subsolution. We also have m^"(T, x) = gn{x), x G M. By using the comparison 
principle (see Lemma 5.7), we have v > for all n. Hence, we get 

V > lim t , 

n 

where the last equality follows from Proposition 6.1. □ 

By the above result, we have the following characterization of the value function when g is l.s.c. 

Theorem 6.5. Suppose A is compact and Assumption 5.6 holds. If g is l.s.c., then the value 
function of our mixed problem is l.s.c. and it is the minimal viscosity supersolution, with 
terminal value greater than g, of the HJBVIs (5.1). 
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A Appendix 


A.l A decomposition property 

We show a property which ensures that under our assumptions (in particular hi or /i 2 is 
then, for each t € [0, T] and for each a £ A*, the barriers Af+ and 

h 2 {s, X^’^’^)ls<:T + g{^T^’^)^s=T satisfy Mokobodzki’s condition (see (2.7)). 

Proposition A.l. Let h : [0,T] x M ^ M be with bounded derivatives, and let g be a 

Borelian map. Suppose that \h{t,x)\ + |fl'(x)| < (7(1 + \x\^). Let t £ [0,T] and a £ A*. There exist 
two nonnegative -supermartingales and 77 “7,3; ^2 




Proof. Set ■= h{s, Xs’^’^)ls<T + 9{X^’^’^)1 s=t- By applying Ito’s formula to /i(s, Af’*’*), 
one can show that there exist three F*-predictable processes (fr), (Tr) and ((^^(r,.)) such that 
d/i(r, A"’*’"') = frdr+!fldW^+ f^(p‘^{r,e)dN\dr,de), with \fr\-\-\(pl\ + \\(p‘^{r, .)||i, < A(1 +|A"i*’"'|p). 
Integrating this equation between s > t and T, and then taking the conditional expectation with 
respect to Aj, we derive that 


h{s,Xf^^n = E[HT,X^’^’^)+ / /.dr I A*] =7,-7', 


where (7^) and (7') are defined for each s £ [t,T] by Ig ■= E[hf (T, X!^'^’^) + dr \iFl] and 

7' := E[hi {T, X!^'^’^) + /,7dr | 7/]. They are both nonnegative supermartingales belonging to 

Si - For each s £ [t,T], we have h{s, Xg’*’^) =7^ — 7' . For each s £ [t,T], set 

L:=Cs-E[g{X^’^’^)\El]. (A.l) 

Let us now show that there exist two nonnegative supermartingales H and H' such that 


Hrp = h't = 0 and lg = Hg-H'„ t<s<T. 


(A.2) 


By (A.l), we have = 0. We thus have that for each s £ [t,T], 


6 = {his,Xf’^n - E[g{X!^’^n\El])ls<T = {Is -I'g- E\g{X^I'^)\Fl])l 


s<T- 


Now, the processes {Ig + E[g~{X^I’^)\Fl])t<s<T and {I'g + E[g'^{X^I’^)\J^])t<s<T are nonnegative 
supermartingales as the sum of two nonnegative supermartingales. 

It follows that the processes {Hs)t<g<T and {II'g)t<s<T defined by 

Hg := (7, + A[<7-(A“’‘’")|A^])l,<r and A' := (7( + A[<7+(A“’*’")|A^])l,<r, 7 < s < A, are also 
nonnegative supermartingales, which are moreover equal to 0 at time T. They satisfy the equality 
^g = Hg — 77' for each s £ [t,T]. Hence, 77 and 77' satisfy equalities (A.2). 

Using equality (A.l), we derive that the processes 77 and 77' defined by Hg := As+A[ 5 r+(A^’*’^)|A^] 
and 77' := A' + Ab-(A“’*’")|A^] are nonnegative supermartingales satisfying ^g = 77^ — 77', 
t < s < T, which ends the proof. □ 


Note that we cannot apply Ito’s formula to h(s. A/’*’’'’) with h{s,x) := h{s, x)lg<T + 9 {x)Is=t since 
g is irregular. The change of variable (A.l) allows us to deal more easily with the irregularity of g. 
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A.2 Measurability and splitting properties 

For each w € let := (wr-As)o<r<T- Let (resp. T^) be the operator defined on 0 by 
S^{uj) :=^u} (resp. T^{u}) := w®). Note that 5® and T® are independent and for each cj G we have 
id = S'®(a;) + T®(a;)l] 5 ^ 7 ’] = ®w + When there is no ambiguity, we identify u with (®a;, w®). 

Let t € [0,T], a € Al and s > t. For P-almost every w, the process a(®a;, T®) (denoted also by 
a(®a;,.)) defined by 


a(®a;,r®) ; Sd x [s,r] (a;', 




belongs to A® (see Lemma 3.3 in [17]). 

Recall that the forward process satisfies the splitting property (see [17]); for all t G [0,T], a £ 
Al and s > t, for almost every uj G Q, setting Cj = ®a;, the process A“’*’®(a;, T®) (denoted also by 
X"’*’*(ti;,.)) coincides with the solution of the forward SDE on x [s,r], driven by W® and A®, 
associated with control {ar{LO, ■))r>s and initial condition A“^^^’*’*(a}) at time s. 

By similar arguments as in [17], we have an analogous property for doubly reflected BSDEs: 


Proposition A.2. (Splitting property for doubly reflected BSDEs) 

Let t G [0,T], a G Al and s G [t,T]. For almost every u G D, setting Cj = the process 
(Yf’^’^(Cj, .))s<r<T coincides with the solution of the doubly reflected BSDE onLlx [s,r], associated 

with driver barriers hi{r^Xr ’ ^ ^ )r<T, terminal condition 


9{x: 




, filtration F®, and driven by IF® and A®. In particular, we have 




»(c2j),t,a 




(Cj)). 


(A.3) 


Let r] G Since p is As-measurable, up to a P-null set, it can be written as a measurable 

map, still denoted by rj, of the past trajectory ®a;. For each cj G 11, by using the definition of the 
function u, we have: 

u{s,r]fluj)) = sup rt"(s,? 7 (®a;)). (A-4) 

aGAs 

For each {t,s) with s > t, we introduce the set A* of restrictions to [s, T] of the controls in Aj. 
They can also be identified to the controls a in A* which are equal to 0 on [t,s]. 

Using the measurability and continuity results of Section 2.1, we show the following result. 

Proposition A.3. Let t G [0,T], s G [t,T] and g G L^(J^). Let e > 0. Suppose that A is compact. 
There exists G A* such that, for almost every lj G Ll, 

u{s,g{^uj)) < u"'(“‘^’')(s,? 7 (®a;))+e. (A.5) 


Moreover, the e-optimal control a® can be constructed so that it depends on the past trajectory ^ui 
only through gifuj). 

If Assumption 4-4 holds, this result still holds when A is a nonempty closed subset of M, not 
necessarily compact. 

Proof. To simplify notation, we suppose that t = 0. We introduce 11® the set of the restrictions 
to [s, T] of the paths a; G H. By Lemma 4.1, the map (x, a) u"(s, x) is R(M) (8)R(A®)-measurable. 
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By Proposition 7.50 in [5] together with a result of measure theory (see e.g. Lemma A.3 in 
[17]), we derive that there exists a Borelian map : ffi —>■ Af ; x ^ a^{x, •) such that 

u{s, x) < x) + e for Q — almost every x G M, (A.6) 

where Q is the law of t] under P. Let {e*,i G N} be a countable orthonormal basis of the sepa¬ 
rable Hilbert space For each x, the map a^(x,-}, which belongs to admits the following 
decomposition: q;^(x, •) = dP <S> du-a.s. and in where / 3 *’^ : M M is a Borelian 

map defined by /3®’^(x) =< q;^(x, •), e®(-) >112. Consider the predictable process dehned for each 
(r, w) G [s,T] X H by a^uj) := (w^). Since A is bounded, is bounded, and hence 

belongs to and thus to As, which gives the desired result. 

When A is only a nonempty closed subset of M, not necessarily bounded, the control cf can 
also be supposed to be bounded because the set of bounded controls in denoted by Al’^, is 
dense in Indeed, let a G A. If a G Af, the bounded controls arl\a^\<n + ®l|ar|>n belong to A* 
and converge to a in as n tends to +00. 

Now, by the continuity Assumption 4.4, u" is continuous with respect to a (see Lemma 4.5). Hence, 

u{s,x) = sup u"(s,x) = sup u"(s,x). 

It follows that there exists a bounded Borelian map : M —>■ A|; x 1—>■ d^(x,-) which satisfies 
inequality (A.6). The proof is thus complete. □ 

By applying this property to e = ^ for each n G N, we derive that under Assumption 4.4, there 
exists an “optimizing sequence” for the value function u{s,r]{-)). 

Corollary A.4. Suppose Assumption 4-4 holds. Let t G [0,T], s G [t,T] and p G There 

exists a sequence (a”),igN £ A* such that for almost every co £ Ll, 

u{s,r]{^uj)) = lim u“"(“‘^’')(s,r/("a;)). 

n^oo 

Moreover, the processes a", n G N, can he chosen so that they depend on the past trajectory 
only through 


A.3 Complementary results on BSDEs and reflected BSDEs with jumps 

We first state a version of Fatou lemma for -conditional expectations (or equivalently for BSDEs) 
where the limit involves both terminal condition and terminal time. 

Lemma A. 5 (A Fatou lemma for BSDEs). Let T > 0. Let f be a given Lipschitz driver satisfying 
Assumption 3.1. Let {9'^)ndN be a non increasing sequence of stopping times in T, converging a.s. to 
9 £ T as n tends to 00 . Let (^"')neN be a sequence of random variables such f/iaf E[sup„(^"')^] < -|-oo, 
and for each n, fA Pgn-measurable. Then, for each stopping time r with t < 9 a.s., we have 
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Proof. Let us show the first inequality. For each n, by the nondecreasing property of £■1', we 
have (infp>„ a.s. Thus, liminf„_^+oo^/^ 6 )n(infp>nC^) < liminf„_^+oo 

a.s. Since lim„_>.+oo infp>„ = lim inf„_,.+oo .P”' a.s., by a continuity property the T-^-conditional 
expectation (see Proposition A .6 in [31]), we get T^g(liminf„^+oo = lim„_j.+oo (infp>„, pp) 
a.s., from which the desired inequality follows. The proof of the second one is similar. □ 

Using this lemma, we now show a continuity property of the solutions of reflected BSDEs with 
respect to both terminal time and terminal condition, which extends the result established in [17] 
at time 0 (see Lemma 3.10) to any time r G T. 

Lemma A.6 (A continuity property for reflected BSDEs). Let T > 0. Let [r]t) he an RCLL 
process in 5^. Let f be a given Lipschitz driver satisfying Assumption 3.1. Let {6^)neN be a non 
increasing seguence of stopping times in T, converging a.s. to 6 £T as n tends to oo. Let (C”)nGN 
be a seguence of random variables such that ]E[esssup„(^"')^] < +oo, and for each n, is 
measurable. Suppose that converges a.s. to an Rg-measurable random variable f as n tends to 
oo. Suppose that 

m a.s. (A.7) 

Let y^ 0 n(^"'); y[, 6 )(^) he the solutions of the reflected BSDEs associated with driver f, obstacle 
{ils)s<e’^ (resp. {r]s)s<e), terminal time 0” (resp. 9), terminal condition ffl (resp. f,). Then, for 
each stopping time r with t < 9 a.s., 

Yrfiif) = lim Yrfir^-iC) as. 

n^+OQ 

When for each n, 9n = 9 a.s., the result still holds without Assumption (A.7). 

Proof. Note that the case where r = 0 has been solved in [17] by using the classical a priori 
estimates on reflected BSDEs. The case where r is any stopping time requires some additional 
arguments. It could be shown by using again Ito’s calculus. We adopt here a less classical approach 
which requires less computations. 

Step 1 : Let us first consider the simpler case when for each n, 9n = 9 a.s. 

Using the a priori estimates on reflected BSDEs provided [15] (see Proposition A.l in [15]) and the 
convergence of to one can show that Tr^ 0 (^) = limn_,.+oo Yr^g{f,'^) a.s. 

Step 2 : Let us now consider the general case. The difficulty is here to deal with the variation of 
the terminal time together with the presence of the obstacle {r]t). In particular. Proposition A.l in 
[15] is not appropriate to this case. By the flow property for reflected BSDEs, we have: 

Yr,9fln =yrAy9MC))- 

By step 1, it is thus sufficient to show that lim„_j, Yg^gflf,^) = ^ a.s. 

Since the solution of the reflected BSDE associated with terminal condition and terminal time 
9n is greater than the solution of the nonreflected BSDE associated with terminal condition and 
terminal time 9n, we have: 

Yg^gfla > £e,eAn ^s. (A.S) 

Now, by the continuity property of BSDEs with respect to both terminal time and terminal con¬ 
dition (Proposition A .6 in [31]), we have limn^oo£e,eni^A = £e,e{f,) = f a-s. Taking the liminf in 
(A.S), we obtain: 

lim inf Yggfl£fl)> lim £ggfl£fl) = £^, a.s. 
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It remains to show that: 


(A.9) 


lim sup < C, a.s. 

n—>-cx) 

By the characterization of the solution of a reflected BSDE, we obtain: 

10 , 0 " = ess sup 8e^rheAr]Ar<er, + 
r£Te 

Fix e > 0. By the second assertion of Theorem 3.3 in [32], there exists an e-optimal stopping time 
€ Te, that is such that 

< ^6»,rSA6l„(hr®lrE<6»„ + + £ a.S. (A.10) 

Note that we have the following property: let A, X\ n € N, be real valued random variables 

with A < A', and let n G N be measurable sets of Tt- 

If A„ ^ A and A)j —>■ a' a.s., then lim sup (A„1 a„ + A^1a= ) < A'a.s. (A.11) 

n^oo 

Now, for each n, A 9n > 0 a.s. and A On tends to 6 a.s. as n —>■ +oo. Hence, by the right- 
continuity property of the obstacle we get rjre/s^Bn ^ < C a.s., where the last inequality 

holds by Assumption A.7. By applying Property (A. 11 ) and since C"" ? a.s., we thus obtain 

lim sup +rir®> 0 „) < C a.s. 

n—>-oo 

Now, the Fatou property for BSDFs (see Lemma A.5) together with (A. 10) implies that 
limsup„_^oo l 0 , 0 ^(^"') < C + £ a.s. Since this inequality holds for each e > 0, we derive inequality 
(A.9). The proof is thus complete. □ 
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